A Proposed Standard for Matrix Metadata∗

نویسندگان

  • Victor Eijkhout
  • Erika Fuentes
چکیده

We propose a standard for storing metadata describing numerical matrix data. The standard consists of an XML file format and an internal data format. We give the abstract description of the XML storage format, APIs (Application Programmer Interfaces) for access to the stored data inside a program, and a core set of categories of data to be stored. The standard defines an open-ended format, allowing for other parties to define additional metadata categories to be stored within this framework. 1 General discussion Matrix storage formats, both file formats and data structures, traditionally limit themselves to specifying only the minimally necessary description of the data: the matrix size, and the matrix elements themselves with a fairly explicit description of the nonzero structure for sparse matrices. However, we can associate with matrix data any number of derived properties, such as norms, spectral properties, or graph properties in the sparse case. There is no standard way of storing such data, making interoperability hard between software modules written by different authors. Such interoperability would be valuable in a number of contexts. For instance, linear algebra algorithms often need, or at least have a use for, difficult to compute matrix statistics, such as condition number estimates. Thus, the total algorithm consists of two disparate modules: one analyser that estimates the numerical quantity, and the algorithm proper which uses this quantity to fine-tune its workings. While any ad hoc fit can be made between such analysis-producing and analysis-consuming software, a more general solution would make componentization of such software possible. We note that analysis modules need not limit themselves, as in the example just given, to calculation or estimation of numerical quantities from the problem data. We can also envision that an application annotate its data before passing it to the numerical routine. Thus, information like the nature of a differential equation or its discretisation can be preserved, and translated to useful numerical information by a different kind of analysis module. There is also use for a more permanent storage format of numerical metadata. We will argue both points, the programmatic and the storage aspects of metadata, in detail below. The existence of a metadata standard for numerical data – we limit ourselves here to matrix data, though extension of these ideas to other fields is natural (see for instance [2, ch. 7]) – makes the following software functionalities possible. • First of all, it allows numerical algorithms to request metadata, not easily derivable from more traditional inputs, that will assist in the computation process. ∗ This work was funded in part by the Los Alamos Computer Science Institute through the subcontract # R71700J-29200099 from Rice University, and by the National Science Foundation under grant # 0203984.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Metadata Enrichment for Automatic Data Entry Based on Relational Data Models

The idea of automatic generation of data entry forms based on data relational models is a common and known idea that has been discussed day by day more than before according to the popularity of agile methods in software development accompanying development of programming tools. One of the requirements of the automation methods, whether in commercial products or the relevant research projects, ...

متن کامل

مقایسۀ مدخل‌های استانداردهای فراداده‌ای در پایگاه‌های نسخه‌های خطی فارسی با مدخل‌های استانداردهای فراداده‌ای در پایگاه‌های خارج از ایران در پوشش مدخل‌های نسخه‌های خطی

Purpose: The present research aims at studying the use of metadata standards in Persian manuscripts databases, and the types and frequencies of these standards in the Optical Character Recognition (OCR) procedure of these databases. Methodology: Research population consists of four Persian databases and 12 Latin databases. The research data is gathered through a checklist, using descriptive su...

متن کامل

استانداردهای آرشیوی، در نرم‌افزارهای دسترسی آزاد و پیشنهاد نرم‌افزار مناسب برای مراکز آرشیوی داخلی

The purpose of this study is Study of Descriptive Metadata Standards in Archival open source software, to determine the most appropriate descriptive metadata standard (s) and also Encoder Software support of these standards. The approach of present study is combination and library methods, Delphi and descriptive survey are used. Data gathering in library study is fiche, in the Delphi method is ...

متن کامل

شناسایی روابط کتابشناختی در فهرست کتابخانه ملی ایران مبتنی بر الگوی ملزومات کارکردی پیشینه‌های کتابشناختی (اف آر بی آر): گام نخست در بازنمون شبکه دانش انتشارات ایرانی-اسلامی

The aim of this study is to find out the bibliographic relationships between the metadata records in the National Library and Archives of Iran (NLAI) according to FRBR model, in order to represent the Knowledge network of Iranian-Islamic publications. To achieve this objective, the content analysis method was used. The study population includes metadata records for books in NLAI for four biblio...

متن کامل

A Proposed Standard for Numerical Metadata∗

We propose a standard for generating and storing metadata describing numerical problems, in particular properties of matrices and linear systems. The standard comprises a storage and a generation component. The storage consists of an XML file format and an internal data format with various access routines; the generation standard describes a format for software that produces metadata. We give t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003